Noise robust speech parameterization based on joint wavelet packet decomposition and autoregressive modeling

نویسندگان

Bojan Kotnik

Zdravko Kacic

Bogomir Horvat

چکیده

In this paper a noise robust feature extraction algorithm using joint wavelet packet decomposition (WPD) and an autoregressive (AR) modeling of the speech signal is presented. In opposition to the short time Fourier transform (STFT) based time-frequency signal representation, a computationally efficient WPD can lead to better representation of non-stationary parts of the speech signal (consonants). The vowels are well described with an AR model like in LPC analysis. The separately extracted WPD and AR based features are combined together with the usage of modified principal component analysis (PCA) and voiced/unvoiced decision to produce final output feature vector. The noise robustness is improved with the application of the proposed wavelet based denoising algorithm with the modified soft thresholding procedure and the voice activity detection. Speech recognition results on Aurora 3 databases show performance improvement of 47.6 % relative to the standard MFCC front-end.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Comprehensive Noise Robust Speech Parameterization Algorithm Using Wavelet Packet Decomposition-Based Denoising and Speech Feature Representation Techniques

This paper concerns the problem of automatic speech recognition in noise-intense and adverse environments. The main goal of the proposed work is the definition, implementation, and evaluation of a novel noise robust speech signal parameterization algorithm. The proposed procedure is based on time-frequency speech signal representation using wavelet packet decomposition. A new modified soft thre...

متن کامل

A New Algorithm for Voice Activity Detection Based on Wavelet Packets (RESEARCH NOTE)

Speech constitutes much of the communicated information; most other perceived audio signals do not carry nearly as much information. Indeed, much of the non-speech signals maybe classified as ‘noise’ in human communication. The process of separating conversational speech and noise is termed voice activity detection (VAD). This paper describes a new approach to VAD which is based on the Wavelet ...

متن کامل

A Generalized Time–Frequency Subtraction Method for Robust Speech Enhancement Based on Wavelet Filter Banks Modeling of Human Auditory System

We present a new speech enhancement scheme for a single-microphone system to meet the demand for quality noise reduction algorithms capable of operating at a very low signal-tonoise ratio. A psychoacoustic model is incorporated into the generalized perceptual wavelet denoising method to reduce the residual noise and improve the intelligibility of speech. The proposed method is a generalized tim...

متن کامل

A New Method for Speech Enhancement Based on Incoherent Model Learning in Wavelet Transform Domain

Quality of speech signal significantly reduces in the presence of environmental noise signals and leads to the imperfect performance of hearing aid devices, automatic speech recognition systems, and mobile phones. In this paper, the single channel speech enhancement of the corrupted signals by the additive noise signals is considered. A dictionary-based algorithm is proposed to train the speech...

متن کامل

Robust Speech Perception Hashing Authentication Algorithm Based on Spectral Subtraction and Multi-feature Tensor

In order to make the speech perception hashing authentication algorithm has strong robustness and discrimination to content preserving operations and speech communication under the common background noise, a new robust speech perceptual hashing authentication algorithm based on spectral subtraction and multi-feature tensor was proposed. The proposed algorithm uses spectral subtraction method to...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2003

Noise robust speech parameterization based on joint wavelet packet decomposition and autoregressive modeling

نویسندگان

چکیده

منابع مشابه

A Comprehensive Noise Robust Speech Parameterization Algorithm Using Wavelet Packet Decomposition-Based Denoising and Speech Feature Representation Techniques

A New Algorithm for Voice Activity Detection Based on Wavelet Packets (RESEARCH NOTE)

A Generalized Time–Frequency Subtraction Method for Robust Speech Enhancement Based on Wavelet Filter Banks Modeling of Human Auditory System

A New Method for Speech Enhancement Based on Incoherent Model Learning in Wavelet Transform Domain

Robust Speech Perception Hashing Authentication Algorithm Based on Spectral Subtraction and Multi-feature Tensor

عنوان ژورنال:

اشتراک گذاری